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Abstract 

The study of the interplay between the testability of properties of Boolean functions and 
the invariances acting on their domain which preserve the property was initiated by Kaufman 
and Sudan (STOC 2008). Invariance with respect to F 2 -linear transformations is arguably the 
most common symmetry exhibited by natural properties of Boolean functions on the hypcrcubc. 
Hence, an important goal in Property Testing is to describe necessary and sufficient conditions 
for the testability of linear-invariant properties. This direction was explicitly proposed for in- 
vestigation in a recent survey of Sudan. We obtain the following results: 

1. We show that every linear- invariant property that can be characterized by forbidding 
induced solutions to a (possibly infinite) set of linear equations can be tested with one- 
sided error. 

2. We show that every linear-invariant property that can be tested with one-sided error can 
be characterized by forbidding induced solutions to a (possibly infinite) set of systems of 
linear equations. 

We conjecture that our result from item (1) can be extended to cover systems of linear equa- 
tions. We further show that the validity of this conjecture would have the following implications: 

1. It would imply that every linear- invariant property that is closed under restrictions to 
linear subspaces is testable with one-sided error. Such a result would unify several previous 
results on testing Boolean functions, such as the testability of low-degree polynomials and 
of Fourier dimensionality. 

2. It would imply that a linear- invariant property V is testable with one-sided error if and 
only if V is closed under restrictions to linear subspaces, thus resolving Sudan's problem. 
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1 Introduction 



Let V be a property of Boolean functions. A testing algorithm for V is a randomized algorithm 
that can quickly distinguish between the case that / satisfies V from the case that / is far from 
satisfying V . The problem of characterizing the properties of Boolean functions for which such an 
efficient algorithm exists is considered by many to be the most important open problem in this 
area. Since a complete characterization seems to be out of reach, several researchers have recently 
considered the problem of characterizing the testable properties V that belong to certain "natural" 
subfamilies of properties. One such family that has been extensively studied is the family of so 
called linear-invariant properties. Our main result is two fold. We first show that every property 
in a large family of linear-invariant properties is indeed testable. Next, we conjecture that an 
even more general family of properties can be tested and show that such a result would give a 
characterization of the linear-invariant properties that are testable with one-sided error. 

1.1 Background on property testing 

We start with the formal definitions related to testing Boolean functions. Let V be a property of 
Boolean functions over the n-dimensional Boolean hypercube. In other words, V is simply a subset 
of the set of functions / : {0, l} n -4 {0, 1}. Two functions /, g : {0, l} n — > {0, 1} are e-far if they 
differ on at least e2 n of the inputs. We say that / is e-far from satisfying a property V if it is e-far 
from any function g satisfying V . A tester for the property V is a randomized algorithm which 
can quickly distinguish between the case that an input function / satisfies V from the case that 
it is e-far from satisfying V . Here we assume that the input function / is given to the tester as 
an oracle, that is, the tester can ask an oracle for the value of the input functions / on a certain 
x E {0, l} n . We say that V is strongly testable (or simply testable) if V has a tester which makes 
only a constant number of queries to the oracle, where this constant can depend on e but should be 
independent 1 of n. Finally, we say that a testing algorithm has one-sided error if it always accepts 
input functions satisfying V. (We always demand that the tester rejects input functions which are 
e-far from satisfying V with probability at least, say, 2/3.) 

The study of testing of Boolean functions began with the work of Blum, Luby and Rubinfeld 
[BLR93] on testing linearity of Boolean functions. This work was further extended by Rubinfeld 
and Sudan [RS96]. Around the same time, Babai, Fortnow and Lund [BFL91] also studied similar 
problems as part of their work on MIP=NEXP. These works are all related to the PCP Theorem, 
and an important part of it involves tasks which are similar in nature to testing properties of 
Boolean functions. The work of Goldreich, Goldwasser and Ron [GGR98] extended these results 
to more combinatorial settings, and initiated the study of similar problems in various areas. More 
recently, numerous testing questions in the Boolean functions settings have sparked great interest: 
testing dictators [PRS02], low-degree polynomials [AKK+05, Sam07], juntas [FKR+04, Bla09], 
concise representations [DLM+07], halfspaces [MORS09], codes [KL05, KS07, KS09]. These are 
documented in several surveys [Fis04, Rub06, Ron08, SudlO], and we refer the reader to these 
surveys for more background and references on property testing. 

1 Observe that since we aim for asymptotic results (that is, we think of n — > co), our property V can actually be 
described as V — Ufei^ni where V n is the collection of functions over the n-dimensional Boolean hypercube which 
satisfy V ■ 
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1.2 Invariance in testing Boolean functions 

What features of a property make it testable? One area in which this question is relatively well 
understood is testing properties of dense graphs [AS08a, AFNS06, BCL+06]. In sharp contrast, this 
question is far from being well understood in the case of testing properties of Boolean functions. 
In an attempt to remedy this, Sudan and several coauthors [KS08, GKS08, GKS09, BS09] have 
recently begun to investigate the role of invariance in property testing. The idea is that in order 
to be able to test if a combinatorial structure satisfies a property using very few queries to its 
representation, the property we are trying to test must be closed under certain transformations. 
For example, when testing properties of dense graphs, we are allowed to ask if two vertices i and 
j are adjacent in the graph, and the assumption is that the property we are testing is invariant 
under renaming of the vertices. In other words, if we think of the input as an dimensional 
0/1 vector encoding the adjacency matrix of the input, then the property should be closed under 
transformations (of the edges) which result from permuting the vertices of the graph. 

A natural notion of invariance that one can consider when studying Boolean functions over 
the hypercube is linear-invariance, which is in some sense the analogue for graph properties being 
closed under renaming of the vertices (we further discuss this analogy in Subsection 1.3). Formally, 
a property of Boolean functions V is said to be linear-invariant if for every function / : FJ? — )■ {0, 1} 
satisfying V and for any F2-linear transformation L : FJ? — > the function / o L satisfies V as well, 
where we define (/ o L)(x) = f(L(x)). Note that here we identify {0, l} n with F^, and we will use 
this convention from now on throughout the paper. For a thorough discussion of the importance 
of linear-invariance, we refer the reader to Sudan's recent survey on the subject [SudlO] and to the 
paper of Kaufman and Sudan which initiated this line of work [KS08] . 

1.3 The main result 

Our main result in this paper (stated in Theorem 3 below) is that a natural family of linear-invariant 
properties of Boolean functions can all be tested with one-sided error. The statement requires some 
preparation. 

Definition 1 ((M, cr)-free) Given an mx k matrix M over ¥2 and a G {0, l} k for integers m > 
and k > 2, we say that a function f : FJ; — >■ {0, 1} is (M, cr)-free if there is no x = (x±, . . . , x^j G 
(F?^ such that Mx = and for all 1 < i < k we have f(xi) = o~i. 

Remark: By removing linearly dependent rows, we can ensure that rank(M) = m without loss 
of generality. We will assume this fact henceforth. 

Let us give some intuition about the above definition. Given a function / : FJ> — > {0, 1}, it is 
natural to consider the set Sf = {x G FJ> : f(x) = 1}. Suppose for the rest of this paragraph that 
in the above definition a = l k . In this case / is (M, a)-free if and only if Sf contains no solution to 
the system of equations Mx = 0, that is, if there is no v G Sf satisfying Mv = 0. Note that when 
considering graph properties, the notion of (M, l fc )-freeness is analogous to the graph property of 
being H-ivee 2 , where H is some fixed graph. Observe that in both cases the property is monotone 
in the sense that if / is (M, l fc )-free, then removing elements from Sf results in a set that contains 

2 If H is a graph on h vertices, then we say that a graph G is H-iree if G contains no set of h vertices that contain 
a copy of H (possibly with some other edges). 
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no solution to Mx = 0. Similarly if G is H-free, then removing edges from G results in an H-free 
graph. 

Let us now go back to considering arbitrary a € {0, l} k in Definition 1, where again the intuition 
comes from graph properties. Observe that a natural variant of the monotone graph property of 
being H-free is the property of being induced H-free 3 . Note that being induced H-free is no longer 
a monotone property since if G is induced H-free then removing an edge can actually create induced 
copies of H. Getting back to the property of being (M, cr)-free, observe that we can think of this 
as requiring Sf to contain no induced solution to the system of equations Mx = 0. That is, the 
requirement is that there should be no vector v satisfying Mv = 0, where Vi G St if <7j = 1 and 
Vi € ¥2 \ Sf if <Ji = 0. So we can think of a as encoding which elements of a potential solution 
vector v should belong to Sf and which should belong to its complement. For this reason we will 
adopt the convention of calling (M, a) a forbidden induced system of equations. 

Continuing with the graph analogy, once we have the property of being induced H-free, for some 
fixed graph H, it is natural to consider the property of being induced H-free where H is a fixed 
finite set of graphs. Several natural graph properties can be described as being induced 'H-free 
(e.g. being a line-graph), but it is of course natural to further generalize this notion and allow 
7i to contain an infinite number of forbidden induced graphs. One then gets a very rich family 
of properties like being Perfect, /c-colorable, Interval, Chordal etc. This generalization naturally 
motivates the following definition which will be key to our main results. 

Definition 2 (J-~-free) Let T = {{M , a l ),(M 2 , a 2 ),. . . } be a (possibly infinite) set of induced 
systems of linear equations. A function f is said to be J- -free if it is {M l , a 1 ) -free 4 for all i. 

Observe that this definition is an OR- AND type restriction, that is, we require that / will not 
satisfy any of the systems (M s , a 1 ), where / satisfies {M l , a 1 ) if it satisfies all the equations of M % 
(in the sense of Definition 1). We are now ready to state our main result. 

Theorem 3 (Main Result) Let T = {(M 1 , a 1 ), (M 2 , a 2 ), . . . } be a possibly infinite set of induced 
equations (that is, all the matrices M l are of rank one), each on more than two variables. Then 
the property of being J- -free is testable with one-sided error. 

Note that, in the above statement, each M l contains a single equation, rather than a system 
of equations as in Definition 2. In fact, though, what we prove is quite a bit stronger: Theorem 3 
holds when each M l is of complexity 1, instead of just rank 1. The notion of complexity of a linear 
system is derived from work by Green and Tao [GT08] (See Section 3.2 for the formal definition.) 
There, we also show that any matrix of rank at most two is of complexity 1, and, hence, Theorem 3 
is obviously a corollary of this stronger result. But for the sake of simplicity, let us restrict ourselves 
to discussing matrices of rank one in this section. 

Let us compare this result to some previous works. One work that initiated some of the recent 
results on testing Boolean functions was obtained by Green [Gre05]. His result can be formulated 
as saying that for any rank one matrix M, the property of being (M, l fc )-free can be tested with 
one-sided error. Green conjectured that the same result holds for any system of linear equations. 
This conjecture was recently confirmed by Shapira [Sha09] and Krai', Serra and Vena [KSV08]. 

3 If H is a graph on h vertices, then we say that a graph G is induced ff-free if G contains no set of h vertices that 
contain a copy of H and no other edges. 
4 In the sense of Definition 1 
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In our language, the results of [Sha09, KSV08] can be stated as saying that for any matrix M, 
the property of being (M, l fc )-free is testable with one-sided error. The case of arbitrary a was 
first explicitly considered in [BCSX09] where it was shown that if M is a rank one matrix, then 
(M, <r)-freeness is equivalent to a finite set of properties, all of which were already known to be 
testable. Tim Austin (see [Sha09]) conjectured that the result of [Sha09] for an arbitrary matrix M 
can be extended to show testability of (M, cr)-freeness for every vector a. Shapira [Sha09] further 
conjectured that his result can be extended to the case when we forbid an infinite set of systems 
of linear equations as in Definition 2. So Theorem 3 partially resolves the above conjecture, since 
it can handle an infinite number of induced equations (but not an infinite number of forbidden 
arbitrary systems of equations). 

Another way to think of Theorem 3 comes (yet again) from the analogy with graph properties. 
Alon and Shapira [AS08a] have shown that for every set of graphs T , the property of being induced 
J 7 - free is testable with one-sided error. Since in many ways 5 , copies of a fixed graph H in a graph 
G correspond to finding solutions of a single equation in a set S Q , Theorem 3 can be considered 
to be a Boolean functions analog of the result of [AS08a]. Just like the graph property of being free 
of a particular subgraph H is analogous to being (M, a)-free where M has rank 1, the hypergraph 
property of being free of a particular sub-hypergraph H is analogous to being (M, <r)-free for an 
arbitrary M. Now, the result of [AS08a] has been later extended to hypergraphs by Austin and 
Tao [AT08] and Rodl and Schacht [RS09]; so, it is natural to expect that one could also handle an 
infinite number of forbidden induced systems of equations in the functional case as well. All the 
above motivates us to raise the following conjecture. 

Conjecture 4 For every (possibly infinite) set of systems of induced equations J-, the property of 
being T-free is testable with one-sided error. 

As the reader can easily convince himself, a graph property V is equivalent to being induced 
H-fcee if and only if V is closed under vertex removal. Such properties are usually called hereditary. 
This motivates us to define the following analogous notion for properties of Boolean functions. 

Definition 5 (Subspace-Hereditary Properties) A linear-invariant property V is said to be 
subspace-hereditary if it is closed under restriction to subspaces. That is, if f is in V n and H 
is a m- dimensional linear subspace o/FJ;, then f\u € V m also, where® f\u : — > {0,1} is the 
restriction of f to H . 

When considering linear-invariant properties, one can also obtain the following (slightly cleaner) 
view of the properties of Definition 2. This equivalence is analogous to the graph properties men- 
tioned above. We stress that this equivalence is a further indication of the "naturalness" of the 
notion of linear-invariance and its resemblance to the closure of graph properties under vertex 
renaming. We defer its proof to the appendix. 

Proposition 6 A linear-invariant property V is subspace-hereditary if and only if there is a (pos- 
sibly infinite) set of systems of induced equations J- such that V is equivalent to being T-free. 

5 This analogy is informal, but see [KSV09] and [SzelO] for some formal connections. 

6 Note that we are implicitly composing f\u with a linear transformation so that it is now defined on F™. Here, 
we are using the fact that T is linear-invariant. 
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We mention that while the notions of graph properties being hereditary and functions being 
subspace-hereditary are somewhat more natural than the equivalent notions of being free of induced 
subgraphs and equations respectively, it is actually easier to think about these properties using the 
latter notion when proving theorems about them. This was the case in [AS08a], and it will be the 
case in the present paper as well. Proposition 6 along with Conjecture 4 implies the following: 

Corollary 7 If Conjecture 4 holds, then every linear-invariant subspace-hereditary property is 
testable with one-sided tester. 

Observe that if Conjecture 4 holds, then Corollary 7 would give yet another surprising similarity 
between linear- invariant properties of boolean functions and graph properties, since it is known 
[AS08a] that every hereditary graph property is testable. Actually, as we discuss in the next 
subsection, if Conjecture 4 holds, then an even stronger similarity would follow. 

Many interesting properties of the hypercube that have been studied for testability are linear- 
invariant. Important examples include linearity [BLR93], being a polynomial of low degree [AKK+05], 
and low Fourier dimensionality and sparsity [GOS + 09]. These properties have all been shown to 
be testable. Moreover, they all turn out to be subspace-hereditary. Thus, if our Conjecture 4 is 
true, as we strongly believe, then we could explain the testability of all these properties through a 
unified perspective that uses no features of these properties other than their linear invariance. Note 
that our main result, Theorem 3, already shows (yet again!) that linearity is testable but from a 
completely different viewpoint than used in previous analysis. Furthermore, to show the testability 
of low degree polynomials (a.k.a., Reed-Muller codes), we would only need to resolve Conjecture 4 
for a finite 7 family of forbidden induced systems of equations. 

1.4 The proposed characterization of testable linear- invariant properties 

We now turn to discuss our second result, which based on Conjecture 4 gives a characterization of 
the linear-invariant properties of Boolean functions that can be tested with one-sided error using 
"natural" testing algorithms. Let us start with formally defining the types of "natural" testers we 
consider here. 

Definition 8 (Oblivious Tester) An oblivious tester for a property V = {V n } n is a (possibly 2- 
sided error) non-adaptive, probabilistic algorithm, which, given a distance parameter e, and oracle 
access to an input function f : Fg — > {0, 1}, performs the following steps: 

1. Computes an integer d = d(e). If d(e) > n, let H = FJj. Otherwise, let H < F2 be a subspace 
of dimension d(e) chosen uniformly at random. 

2. Queries f on all elements x £ H . 

3. Accepts or rejects based only on the outcomes of the received answers, the value of e, and its 
internal randomness. 

We now discuss the motivation for considering the above type of algorithms. The fact that 
the tester is non-adaptive and queries a random linear subspace is without loss of generality (see 
Proposition 33); this is analogous to the fact [AFKS00, GT03] that one can assume a graph property 

7 The characterization of polynomials of degree d using forbidden induced equations is shown in Appendix A. 
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tester makes its decision only by inspecting a randomly chosen induced subgraph. The only essential 
restriction we place on oblivious testers is that their behavior cannot depend on the value of n, the 
domain size of the input function. If we allow the testing algorithm to make its decisions based on 
n, then it can do very strange and unnatural things. For example, we can now consider properties 
that depend on the parity of n. As was shown in [AS08b], the algorithm can use the size of the 
input in order to compute the optimal query complexity. All these abnormalities will not allow 
us to give any meaningful characterization. As observed in [AS08a] by restricting the algorithm 
to make its decisions while not considering the size of the input, we can still test any (natural) 
property while at the same time avoid annoying technicalities. We finally note that all the testing 
algorithms for testable properties of Boolean functions in prior works were indeed oblivious, and 
that furthermore many of them implicitly consider only oblivious testers. In particular, these types 
of testers were considered in [SudlO]. 

As it turns out, oblivious testers can potentially 8 test properties which are slightly more general 
than subspace-hereditary properties. These are defined as follows. 

Definition 9 (Semi Subspace-Hereditary Property) A property V = {V n } n is semi subspace- 
hereditary if there exists a subspace-hereditary property % such that 

1. Any function f satisfying V also satisfies H. 

2. There exists a function M : (0, 1) — > N such that for every e £ (0, 1), if f : — > {0, 1} is 
e-far from satisfying V and n > M(e), then f\y does not satisfy Ti. 

The intuition behind the above definition is that a semi subspace-hereditary property can only 
deviate from being "truly" subspace-hereditary on functions over a finite domain, where the finite- 
ness is controlled by the function M in the definition. Our next theorem connects the notion of 
oblivious testing and semi subspace-hereditary properties. Assuming Conjecture 4, it essentially 
characterizes the linear-invariant properties that are testable with one-sided error, thus resolving 
Sudan's problem raised in [SudlO] . 

Theorem 10 If Conjecture 4 holds, then a linear-invariant property V is testable by a one-sided 
error oblivious tester if and only if V is semi subspace-hereditary. 

Getting back to the similarity to graph properties, we note that [AS08a] obtained a similar 
characterization for the graph properties that are testable with one-sided error. Let us close by 
mentioning two points. The first is that most linear-invariant properties are known to be testable 
with one-sided error, and hence the question of characterizing these properties is well motivated. 
In fact, for the subclass of linear-invariant properties which also themselves form a linear subspace, 
[BHR05] showed that the optimal tester is always one-sided and non-adaptive. Our second point 
is that it is natural to ask if there are linear-invariant properties which are not testable. A linear- 
invariant property with query complexity Q(2 n ) arises implicitly from the arguments of [GGR98]; 
see Section 5 for a brief sketch. A second, more natural, example comes from Reed-Muller codes. 
[BKS + 09] shows that for any 1 <C q(n) <C n the linear-invariant property of being a log2(<?(ra))- 
Reed-Muller code cannot be tested with o(q(n)) queries. We also conjecture that the property 
of two functions being isomorphic upto linear transformations of the variables is not a testable 

8 The potential relies on the validity of Conjecture 4. 
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property. Lower bounds for isomorphism testing have been studied both in the Boolean function 
model [FKR + 04, BO10] and in the dense graph model [Fis05], but our problem specifically does 
not seem to have been examined in a property testing setting. 

1.5 Paper overview 

The rest of the paper is organized as follows. In Section 2, we discuss the regularity lemma of Green 
[Gre05]. Just as the graph regularity lemma of Szemeredi [Sze78] guarantees that every graph can be 
partitioned into a bounded number of pseudorandom graphs, Green's regularity lemma guarantees a 
similar partition for Boolean functions. This lemma, whose proof relies on Fourier analysis over 
was used in [Gre05] to show that properties defined by forbidding a single (non-induced) equation 
are testable. This basic approach falls short of being able to handle an infinite number of forbidden 
non-induced equations or even a single forbidden induced equation. We thus need to develop a 
variant of Green's regularity lemma that is strong enough to allow such applications. This new 
variant is described in Section 2. The overall approach is motivated by that taken by Alon et al. 
[AFNS06] in their formulation of the functional graph regularity lemma. However, the proof here 
is somewhat more involved since we need to develop several tools in order to make the approach 
work. One of them is a certain Ramsey type result for F?? which is key to our proof and that may be 
useful in other settings (see Theorem 19). The approach of [AFNS06] only allows one to handle a 
finite number of forbidden subgraphs, which translates in our setting to being able to handle a finite 
number of forbidden equations. So, one last technique we employ is motivated by the ideas from 
[AS08a] on how to handle an infinite number of forbidden subgraphs. This (somewhat complicated) 
technique is described in Section 3. We believe that these set of ideas will prove to be instrumental 
in resolving Conjecture 4. Section 5 is devoted to some concluding remarks and open problems. 

2 Pseudorandom Partitions of the Hypercube 

The support of a Boolean function / refers to the subset of the domain on which / evaluates to 

1. If H is a subspace of FJ? and given function / : H — > {0, 1}, let p(f), the density of /, denote 
y* ft x \ 

— . Recall that the Fourier coefficients of /, defined for each a G H*, are: 

fix) ■ (-l) {x ' a) ~ 

For a parameter e G (0, 1), we say / is e-uniform if max^^o |/(a)| < e. This definition captures the 
notion of correlation with a linear function on H, and it will serve as our definition of pseudoran- 
domness. 

Given a function / : — > {0, 1}, a subspace H < Fg and an element g G Fg , define the function 
f^ 9 : H —¥ {0, 1} to be f^j 9 {x) = f{x+g) for x G H. The support of f^ 9 represents the intersection 
of the support of / with the coset g + H. The following lemma shows that if a uniform function 
is restricted to a coset of a subspace of low codimension, then the restriction does not become too 
non-uniform and its density stays roughly the same. 

Lemma 11 Let f : — > {0, 1} be an e-uniform function of density p, and let H < F?> be a 
subspace of codimension k. Then for any c G F^, the function f^ c : H — > {0, 1} is (2 k : e) -uniform 
and of density p c satisfying \p c — p\ < 2 k e. 



/(«)= E 
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Proof: Let H 1 - = {a G F?>| (a,h) = V7i € -ff} be the dual to the vector space H, and let 
H' = F2/H be the quotient of H in E^. We wish to show that, for every c € H', the Fourier 
coefficients of are small. 

For every /3 € F^/if- 1 and a £ if- 1 : 

/(/3 + a) = E [/(x) X/3+Q (*)] = E E f+ c \h) Xna (c' + h)= E X p +a (c') E f+ c ' (h) X p(h) 

X(z.\F r n> C H > 



c'eH' heH 



c'ew 



2 k 



c'eH' 



Recall that Y. a &H± Xa(c') = < ' Fixing /3 G F^/if- 1 and c £ H' and summing up the 

I 1, if c =0. 

quantity computed above over all a € H^~ : we obtain 



2 k [ ^ X/3 +a (c)/(/3 + a) 



E ^ X/3 +Q (c + c')/+ c '(/5) 

E x/3 +a (o)/+ c (/3) + E E + c ')^ c '(/?) 

2 fc 7^ c (/3) + E E Wc')fi c ' +C (/3) 

c'6ff'-{0} aeff 1 

2 fc /^ c (/3) + E Mc) I E ^ c ') J Jh ,+c ^) 

c'6ff'-{0} / 

2 fc /+ c (/3). 



Furthermore, 



/F(0) 



E X/3+a( c )/(/5 + «) 



< E \x^a(c)f(l3 + a) 



a£H A 



lea 1 



\f(P + <x) 



Since / is e-uniform, setting j3 = in the above inequality shows that \p c —p\ < So^aG-H"- 1 - < 
2 k e. For nonzero /3 in E^/H^, it follows again from e- uniformity that |/^ c (/3)| < 2 k e. ■ 

For a subspace H < the H-based partition refers to the partitioning of F2 into the cosets 
in E'2/H. If H' < H, then the if'-based partition is called a refinement of the if-based partition. 
The order of the if-based partition is defined to be [G : H], i.e., the index of if as a subgroup or 
the dimension of the quotient space E^/H. Using this notation, Green's regularity lemma can be 
stated as follows. 



Lemma 12 (Green's Regularity Lemma [Gre05]) For every m and e > 0, there exists T = 
T^(to, e) such that the following is true. Given function f : F2 — > {0, 1} with n > T and H-based 
partition of F^ with order at most m, there exists a refined H' -based partition of order k, with 
m < k < T , for which f^f is not e-uniform for at most e2 n many g G 
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Our main tool in this work is a functional variant of Green's regularity lemma, in which the 
uniformity parameter e is not a constant but rather an arbitrary function of the order of the 
partition. It is quite analogous to a similar lemma, first proved in [AFKSOO], in the graph property 
testing setting. The recent work [GT10] shows a (very strong) functional regularity lemma in the 
arithmetic setting but it applies over the integers and not F2. 

Lemma 13 (Functional regularity lemma) For integer m and function £ : TL^ — > (0, 1), there 
exists T = Tjg(m,£) such that the following is true. Given function f : — > {0, 1} with n > T, 
there exist subspaces H' < H that satisfy: 



Order of H -based partition is k > m, and order of H' -based partition is I <T. 
There are at most £(0) ■ 2 n many g € F?? such that f~^ 9 is not £(0)-uniform. 

For every g £ F?j, there are at most £(k) ■ 2 n ~ k many h € H such that f^f +h is not £{k)- 
uniform. 

• There are at most £ (0) • 2" many g € F?> for which there are more than £(0) ■ 2 n ~ k many 
heH such that \p(f£ 9 ) - p(fx? +h )\ > 5(0). 

Proof: Let us first give an informal overview of the proof. The basic idea is to repeatedly apply 
Lemma 12, at each step refining the partition obtained in the previous step. At each step, Lemma 
12 is applied with a uniformity parameter that depends on the order of the partition obtained in 
the previous step. We stop when the index of the partitions stop increasing substantially. Given a 
subspace H, the index of the H -based partition is defined to be the variance of the densities in the 
cosets: 

We show that when the indexes of two successive partitions are close, then on average, each coset 
of the finer partitioning has roughly the same density as the coset of the coarser partitioning it is 
contained in. 

To implement the above ideas, we need the following two claims about the index of partitions. 
Their proofs are essentially identical to those for the corresponding Lemmas 3.6 and 3.7 respectively 
in [AFKSOO], and so we are a bit brief in the following. 

Claim 14 Given subspace H < and function f : FJ? — > {0,1}, suppose that there are at least 
e2 n many g G F™ such that \p(f) - /)(/^ 9 )| > e. Then: 



ind(/, J ff)>p 2 (/) + ^ 



Proof: Observe that the average of pif^ 9 ) over all g £ F?? equals p(f). From our assumptions, 
either there are |2 n many g € F2 such that p(f) — p{f^ 9 ) > e or there are |2™ many g € F2 
such that p(f) — p(ftf 9 ) < — e. For either case, we can use the defect form of the Cauchy-Schwarz 
inequality to prove our claim. ■ 
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Claim 15 For function f : F?j — > {0, 1} and subspaces H' < H < suppose the H -based partition 
of order k and its refinement, the H' -based partition, of order i satisfy ind (/, H') — ind(/, H) < y 
for some e. Then, there are at most e2 n many g € Fj for which there are more than e2 n ~ k many 
h£H satisfying \p(f+ g ) - p(f+? +h )\ > e. 

Proof: Suppose that there are > e2 n many g € F?> such that there are > e2 n ~ k many h E H 
satisfying \p{f^ g ) — p{f^ +h )\ > e. Use Claim 14 to obtain a contradiction: 

ind(/,^) = ^ E p 2 (fP) = ^ E^E p 2 (fP +h ) 

= y* E ind (/^) 

v£F%/H 

>pf E + 

= ind(/,fT) + j 

■ 

Now we have the pieces needed to prove the lemma. We can assume £(■) is monotone non- 
increasing. Let e = f(0). We define T inductively as follows. Let = Ti 2(771, e), and for i > 1, 
let: 

T«=T 12 (V-VfV- 1 )) •2^- 1) ) 

Setr = r 13 (m,^ f r( 2 ^ 4+1 ). 

We now show that this choice of T suffices. Given function / : FJJ — >■ {0,1}, apply Lemma 
12 with to and e to get a subspace Hi, and thereafter repeatedly apply it to get a sequence of 
finer subspaces Hi, H^,H^, . . . , with Hi > H2 > H3 > #4 > by invoking Lemma 12 at 
each step i > 1 with T^" 1 ) and £ (T^" 1 - 1 ) • 2~ T ^ x) as the two input parameters. Stop when 
ind(/, Hi + i) — \nd(f,Hi) < This happens when i is at most 2e -4 + 1 because the index of any 
partition is less than 1. Let H = Hi and H' = Hi+\. It's clear that the codimension k of H at least 
to and that the codimension I of H' is at most T. The second item in the lemma follows from the 
uniformity guarantee of Lemma 12 and from the fact that £{T^ l ~ 1 ^) < £ (0). For the third, note 
that Lemma 12 guarantees that there are at most £{k)2~ k 2 n = 8(k)2 n ~ k values of g G F£ such that 
f H f is not {£ (/c)2~ fc )-uniform and, hence, not £(fc)-uniform. So, clearly, there are at most so many 
g contained in any coset of H. Finally, the fourth item follows from Claim 15. This completes the 
proof of Lemma 13. ■ 

We use Lemma 13 in two main ways. For one of them, we use the lemma directly. For the 
other, we use the following simple but extremely useful corollary which allows us to say that there 
are many cosets in a partitioning which, on the one hand, are all uniform, and on the other hand, 
are arranged in an algebraically nice structure. 

Corollary 16 For every m and £ : Z + — > (0,1), there exist T = T-^Q{m,£) and 5 = 5fQ(m,£) 
such that the following is true. Given function f : F?? — > {0, 1} with n >T , there exist subspaces 
H' < H < F2 and an injective linear map I -.Y^/H — > W^/H' such that: 
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• The H-based partition is of order k, where m < k < T. Additionally, \H'\ > 52 n . 

• For each u G W^/H, I(u) + H' lies inside the coset u + H . Note that 1(0) = since I is 
linear. 

• For every nonzero u G W^/H, the set f^f is £ (k) -uniform. 

• There are at most £ (0)2 n many g G F'2 for which \p(f^j 9 ) — p(f%' )\ > ^(0) where u = g 
(mod F). 

Proof: We can assume £ is a nonincreasing function. Denote £(0) as e, and set £'(r) = min(£(r), |, 

We will show that T = T^(m : £) A =T^(m,£') and 5 = 5^g(m, £) d =l/2 T suffice for our proof. 

Apply Theorem 13 with m and the function £' as inputs. Let H and H' be the subspaces 
obtained there, for the given / : FJ? — > {0, 1}. We find / satisfying the conditions of the claim exists 
using the probabilistic method. 

Fix k linearly independent elements u\, . . . , £ F^/i? (viewing V^/H as a vector space over 
F2). For every i G [k], choose independently and uniformly at random an element v from H / H' 
and let I(ui) equal itj + v + H'. The value of / over the rest of F^/i? is determined by linearity, as 
the Kj's form a basis for F^/H. It's immediate that I(u) + H' lies inside u + H for every u G W^/H. 

Observe that unless u = 0, each J(n) is uniformly distributed among the cosets of H' lying 
in u + H. Hence, for any nonzero u, the probability that f^f^ is not £"(A;)-uniform is at most 
l/2 fc+1 , by our choice of parameters. Applying the union bound, the probability that there exists 

nonzero u G F??/ H such that f^f is not £'(/c)-uniform is at most 1/2. Also, the expected number 

of g G F£, with u = g (mod H), for which |p(/^ 9 ) - y o(/^, /(u) )| > e is at most § 2 n + f 2 n + 1 < f 2 n , 
and hence by the Markov inequality, with probability at least ^, the number of g G FJ? satisfying 
this condition is at most e2 n . Therefore, there must exist a choice of I making both the third and 
fourth claims true. ■ 

The next lemma is in a similar spirit to Corollary 16. It also obtains a set of uniform cosets which 
are structured algebraically, but in this case, all of them are contained inside the same subspace. 

Lemma 17 For every positive integer d and 7 G (0,1), there exists 5 = S^/y^d, 7) such that the 
following is true. Given f : F2 — > {0, 1}, there exists a subspace H < Fg and a subspace K of 
dimension d in the quotient space F^/i? with the following properties: 

• \H\ > 52 n . 

• For every nonzero u G K , f^ u is ^-uniform. 

• Either p{f^ u ) > \ for every nonzero u G K or p(f^ u ) < \ for every nonzero u G K . 

We need a different set of tools to prove this lemma. Specifically, we use linear algebraic 
variants of the classic theorems of Turan and Ramsey. We note that the (classic) Turan and 
Ramsey Theorems are key tools in many applications of the graph regularity lemma, for example 
in the well known bound on the Ramsey numbers of bounded degree graphs [CRSW83]. Hence, 
the variants that we use of these classic results may be useful in other applications of Greens's 
regularity lemma. 
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Proposition 18 (Turan theorem for subspaces) For positive integers n, if S is a subset of 
with density greater than 1 — t^Lt, then there exists a subspace H <¥% of dimension d such that 
H — {0} is contained in S. Moreover, there is a subset ofF^ with density (l — t^t) which does 
not contain H — {0} for any subspace H < FJ;. 

Proof: Let S C F?> be a maximal set that does not contain H — {0} for any d-dimensional subspace 
H . Since S is maximal, it must contain K — {0} for some (d — l)-dimensional subspace K (if not, 
we can simply add it to S without introducing points of H — {0} for any d-dimensional subspace 
H). Let K' be an (n — d + l)-dimension subspace that intersects K only at {0}. 

Now, observe that for any nonzero a G K' , at least one of the elements of {a + k : k E K} 
must not belong to S. Otherwise, S would contain (K — {0}) U {a + k : k E K} = H — {0} for 
a (i-dimensional subpace H = span(J^ U {a}), contradicting our assumption for S. Thus, we can 
upper-bound the number of points in S by: 

\S\ < \K' - {0}| • (\K\ -1) + \K- {0}| = (2 n ~ d+1 - 1) • (2 d - 1 - 1) + (2 d ~ 1 - 1) = 2 n - 2 n ~ d+1 

To see that the above bound is tight, let S = — K' for any (d — l)-dimensional subspace 
K < ¥2 and K' as above. It is easy to check that this S does not contain H — {0} for any H <¥^ 
with dim(iT) = d. ■ 



Theorem 19 (Ramsey theorem for subspaces) 9 For every positive integer d, there exists 
N = N}g(d) such that for any subset S C F^, there exists a subspace H < F^ of dimension 
d such that H — {0} is contained either in S or in S. 

Proof: We will show a stronger statement, which we describe in the following lemma. 

Lemma 20 For every positive integer d\,d2, there exists N(di,d<2,) such that for any subset S C 
jpAT(<2i,d 2 \ either there exists a subspace FL\ < jp^ rfl,d2 ^ y dimension d\ such that FL\ — {0} is 
contained in S or there exists a subspace H2 < F^ dl,d2 ^ of dimension d<i such that H2 — {0} is 
contained in S. 

One can immediately deduce the statement of the theorem by taking d = d\ = d^ in Lemma 20. 
To prove Lemma 20 we first prove the following helpful result. For a subspace H < Fg we say that 
an affine subspace a + H is strict if a G ¥2/ H — {0}. 

Lemma 21 For every positive integer d, there exists N a = N a (d) such that for any subset S C F^ a , 
there exists a strict affine subspace A < F^ a of dimension d such that A is contained either in S 
or in S. 

Proof: Notice that iV a (l) = 1. Assume, by induction that the lemma holds for dimension d — 1, 
and let N a (d) = 2 N ^ d -^ +1 + N a {d - 1). Let S C F^ a(d) be an arbitrary set, let H = Ff a(d_1) , 
and H 1 = F^ a((i) /H. Notice that \H'\ = 2 2JVa(d " 1)+1 . For each c G H' - {0} consider the set 
fff C C H. Since there are 2 2Na(d — 1 possible such sets, and each set has size at most 2 N °-( d ~ 1 " > 

9 'As pointed to us recently by Noga Alon, this theorem might be implied by the Folkman-Rado-Sanders Theorem, 
but we include a self-contained proof for the sake of completeness. 
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it follows that there exists c\ 7^ c 2 £ H' — {0} such that f^ Cl = f^ C2 - By the induction hypothesis, 
either /# C1 or its complement contains a d — 1 dimensional affine subspace. Assume w.l.o.g. that 
fx 01 contains an affine subspace a + fd-\ of dimension d — 1 (otherwise replace S by 5), for some 
a £ H — fd-i- Then the affine subspaces a + c\ + /^-l and a + C2 + fd-i are both contained 
in S. Let A^ = (a + c\ + /d-i) U (a + C2 + fd-i) C 5. To conclude the proof, notice that 
A^ = a + c\ + span(c2 — ci,/d_i) is a strict affine subspace of dimension d, since a / q and 
C2 - ci ■ 



Proof of Lemma 20: The proof follows by induction on <ii and (f2, with the base cases iV(0, 1) = 
iV(l,0 = 1. Assume that there exists N(d\ — l,d 2 ) and N(d\,d 2 — 1) satisfying the conditions of 
the lemma. Define 

N(d 1 ,d 2 ) = N a (max(N(dt - 1, d 2 ), N(d u d 2 - 1))), 

where N a (d) is the quantity defined in Lemma 21. We show that for any arbitrary set S C f^ r ( dl ' d2 ) 
either it contains a subspace of dimension d\ (except 0) or its complement contains a subspace of 
dimension d 2 (except 0). Suppose N(d\ — l,d 2 ) > N{d\,d 2 — 1). By definition and by Lemma 21, 
there exists a strict affine subspace A C jr^( rfl ' rf2 ) suc h that A = a + HQSovAQS (where H is 
the subspace underlining A). Assume for now that the former holds. Since 

the induction hypothesis, either H n S contains a subspace of dimension d\ — 1 or H — S contains 
a subspace of dimension d 2 , in which case we are done. If H D S contains a subspace fd x -i — {0} of 
dimension d\ — 1, then define fd x = /<2i-i U a + /di-i = span(a, fd 1 ~i)- Clearly G S 1 and it has 
dimension d\, which completes the proof of this case. It remains to deal with the case when ACS. 
Since N(d\ — l,d 2 ) > N(d\,d 2 — 1), there exists another affine subspace A' = a' + H' C A C S 
of dimension N(d±,d 2 — 1). Again, by the induction hypothesis, the set H' PI S either contains 
a subspace of dimension d±, in which case we are done, or H' — S contains a subspace fd 2 -i of 
dimension c?2 — 1. In the latter case define fd 2 = fd 2 -i Ua' + /d 2 -i — span(a', fd 2 ~\)- Finally, notice 
that fd 2 G S and it has dimension d 2 . ■ 

This concludes the proof of Theorem 19M 

Given these results, Lemma 17 follows fairly readily. 

Proof of Lemma 17: Set 5 = 5 17 (d, 7) d =2" T 12 (r ' min(2 " r " 2 ' 7)) with r = N ig (d). Given / : 
— > {0, 1}, apply Lemma 12 with inputs r and min(2 _r_2 ,7) to obtain a subspace H such that 
restrictions of S to at most 2~ r ~ 2 fraction of the cosets of the fZ-based partition are not 7-uniform. 
Using Proposition 18, there exists a subspace L < V^/H of dimension r such that for every nonzero 
u £ L, the set is 7-uniform. Furthermore, since L is of dimension Nig(d), by Theorem 19, 
there exists a subspace K < L < F?> /i/ satisfying the final condition of the lemma. ■ 



3 Forbidding Infinitely Many Induced Equations 

In this section, we prove our main result (Theorem 3) that properties characterized by infinitely 
many forbidden induced equations are testable. To begin, let us fix some notation. Given a matrix 
M over F2 of size m-by-k, a string a £ {0, l} fc , and a function / : FJ? -4 {0,1}, if there exists 
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x = (x±, . . . ,Xk) £ (F^)* such that Mi = and f(xi) = o~i for all i € [fc], we say that / induces 
(M, a) at x and denote this by (M, o) \-t f. 

The following theorem is the core of the proof of Theorem 3. 

Theorem 22 For every infinite family of equations T = {(E 1 , a 1 ), (E 2 , a 2 ), . . . , (E l ,a l ), . . . } with 
each E i being a row vector [11 • • • 1] of size h L and a 1 € {0, l} k * a hi-tuple, there are functions 
Njr{-), kjr{-) and8jr(-) such that the following is true for any e € (0,1). If a function f : Fg —> {0,1} 
with n > Njr(e) is (.-far from being J- -free, then f induces 5 ■ 2 n ( fci_1 ) many copies of some (E % , a 1 ), 
where ki < kjr(e) and 5 > <5jr(e). 

Armed with Theorem 22, our main theorem becomes a straightforward consequence. We post- 
pone the proof of this, because we will prove a stronger fact in Section 3.2. To start the proof of 
Theorem 22, let us relate pseudorandomness (uniformity) of a function to the number of solutions 
to a single equation induced by it. Similar and more general statements have been shown previously, 
but we need only the following claim for what follows. 

Lemma 23 (Counting Lemma) For every r/ £ (0, 1) and integer k > 2, there exist 7 = 7^3(77, k) 
and 5 = 623(1], k) such that the following is true. Suppose E is the row vector [1 1 • • • 1] of size 
k, a 6 {0, l} k is a tuple, H is a subspace of Fg, and f : Fg — > {0, 1} is a function. Furthermore, 
suppose there are k not necessarily distinct elements u\,...,Uk £ such that Mu = where 

u = (u\, . . . , Uk), ftf Ur ■ H — > {0, 1} is ^-uniform for all i £ [k], and p(f^ Ul ) is at least rj if o~(i) = 1 
and at most 1 — rj if o~(i) = for all i € [A;]. Then, there are at least o~|-ff| fc_1 many k-tuples 
x = (x\,X2, ■ ■ ■ , Xk), with each X{ € U{ + H , such that f induces (E, a) at x. 

Proof: Fix vy € U\ + H, V2 € U2 + H, . . . , v% € Uk + H such that V\ + v-i + • • • + Vk = 0; there 
exist such Uj's because u± + U2 + • • • + Uk = in the quotient space WQ/H. Define Boolean functions 
/i, . . . ,/ fc : # -> {0, 1} so that fi{x) = f£\x) if a(i) = 1 and / 4 (x) = 1 - f+ Vi (x) if a(i) = 0. 
By our assumptions, /i(0) > rj and each |/j(a)| < 7 for all a 7^ 0. Now, observe that, using 
7-uniformity and Cauchy-Schwarz, we have: 

E rr [fi{xi)f 2 (x2) ■ ■ ■ fk-l(xk-i)fk(xi +x 2 -\ h x k -i)] 

x 1 ,...,x k _ 1 eH 

= ^2 M a )h( a ) ■ ■ ■ fk(a) 

a&H* 

>^-^|/i(a)/ 2 (a).../ fc (a)| 

> T] k — 7 fe ~ 2 

Setting 7 = 723(7/5 k) d = (r/ k /2) 1 /( k ~ 2 ' > makes the above expectation at least ?y fc /2. Now note that 
every x\, . . . ,x k £ H such that x\ + ■ ■ ■ + Xk = gives y = (yi, . . . , y^), where y« = u« + Xj for all 
i € [fc], such that / induces (E,a) at y. Thus, we have from above that there are at least <5 1 | fc 1 

many such y's, where 5 = ^23(7?, k) = r] k /2. ■ 
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3.1 Proof of Theorem 22 



Before seeing the full technical details of the proof of Theorem 22 we proceed with a more intuitive 
overview. 

In light of Lemma 23, our strategy will be to partition the domain into uniform cosets, using 
Green's regularity lemma (Lemma 12) in some fashion, and then to use the above counting lemma 
to count the number of induced solutions to some equation in T . But one issue that immediately 
arises is that, because T is an infinite family of equations, we do not know the size of the equation 
we would want the input function to induce. Since Lemma 23 needs different uniformity parameters 
to count equations of different lengths, it is not a priori clear how to set the uniformity parameter 
in applying the regularity lemma. (If T was finite, one could set the uniformity parameter to 
correspond to the size of the largest equation in J-.) 

To handle the infinite case, our basic approach will be to classify the input function into one 
of a finite set of classes. For each such class c, there will be an associated number k c such that 
it is guaranteed that any function classified as c must induce an equation in T of size at most 
k c . If there is such a classification scheme, then we know that any input function must induce an 
equation of size at most max c k c . How do we perform this classification? We use the regularity 
lemma. Consider the following idealized situation. Fix an integer r. Suppose we could modify the 
input / : FJ? — >• {0, 1} at a small fraction of the domain to get a function F : FJ? — >• {0, 1} and then 
could apply Lemma 12 to get a partition of order r so that the restrictions of F to each coset was 
exactly 0-uniform. F is then a constant function (either or 1) on each of the 2 r cosets, and so, 
we can classify F by a Boolean function fj, : F2 — > {0, 1} where fi(x) is the value of F on the coset 
corresponding to x. Notice that there are only finitely many such /i's. Since F differs from / at only 
a small fraction of the domain and since / is far from J-~-free, F must also induce some equation in 
T . Then, for every such \x and corresponding F, there is a smallest equation in T that is induced 
by F . We can let ^j-(r) be the maximum over all such \i of the size of the smallest equation in 
F that is induced by the F corresponding to fj,. We then might hope that this function ^j-(-) can 
be used to tune the uniformity parameter by using the functional variant of the regularity lemma 
(Lemma 13). 

There are a couple of caveats. First, we will not be able to get the restrictions to every coset 
to look perfectly uniform. Second, if F induces solutions to an equation, it does not necessarily 
follow that / also does. To get around the first problem, we use the fact that Lemma 23 is not very 
restrictive on the density conditions. We think of the uniform cosets which have density neither too 
close to nor 1 as "wildcard" cosets at which both the restriction of / and its complement behave 
pseudorandomly and have non-negligible density. Thus, the \i in the above paragraph will map into 
{0, 1, *} r , where a '*' denotes a wildcard coset. For the second problem, note that it is not really a 
problem if .F-freeness is known to be monotone. In this case, F inducing an equation automatically 
means / also induces an equation, if we obtained F by removing elements from the support of /. 
For induced freeness properties, though, this is not the case. Using ideas from [AFKS00] and the 
tools from Section 2, we structure the modifications from / to F in such a way so as to force / to 
induce solutions of an equation if F induces a solution to the same equation. We elaborate much 
more on this issue during the course of the proof. 

The observations described in the proof sketch above motivate the following definitions. 

Definition 24 Given function [i : Frj — > {0, 1, *}, a m-by-k matrix M and a k-tuple a € {0, l} k , 
suppose there exist x\, . . . , xu € W 2 such that Mx = where x = (x\, . . . , Xf.), and for every i € [k], 
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fi(xi) equals either a (i) or *. In this case, we say /i partially induces (M, a) at x and denote this 
by (M, a) i-s>* /i. 

Definition 25 Given a positive integer r and an infinite family of systems of equations T = 
{(M 1 , a 1 ), (M 2 , a 2 ), . . . } with M l being a rrii-by-ki matrix of rank mi and a 1 G {0, l} kl a ki-tuple, 
define T r to be the set of functions fi : F£ — > {0, 1, *} such that there exists some (M l , a 1 ) G J- with 
(M l , a 1 ) i— fi. Given T and integer r for which T r ^ 0, define the following function: 

^l'jr(r) ^= f max min k{ 

Proof of Theorem 22: Define the function £ by setting £(0) = e/8 and for any r > 0: 

f(r) = %(Mr),72 3 (€/8 J *jr(r)))-nii I i(e/8,723(e/8,*jr(»-))) 
Additionally let T(e) = T 16 (8/e,£), and set N T (ef=T(e). Also, set fcjr(e) d =^jr(T(e)) and 

$r(e)=(*17CMr), 723(^/8, MO)) ■ 5i 6 (8/ £ , £))^ {e) • %(e/8, ^(T( e ))) 

We proceed to show that these parameter settings suffice. 

Suppose we are given input function / : — > {0,1} with n > Njr(e) = T]_g(8/e, £). As 
mentioned in the paragraphs preceding the proof, our strategy will be to partition the domain in 
such a way that we can find cosets in the partition satisfying the conditions of Lemma 23. To 
this end, we apply Corollary 16 with 8/e and the function £ as inputs. This yields subspaces 
H' < H < ¥2 and linear map I : W'^/H — > W^/H', where the order of the i/-based partition, which 
we denote £, satisfies 8/e < £ < T^g(8/e, £). Recall that I{u) + H' is contained in u + H for every 
coset u € W2/H. Observe that from our setting of parameters, we have that for every nonzero 
u G F$/H, the restriction is (d 17 ^f T (£),j 23 (e/8,^ T (£))) ■ 723 ( e /8, ^(£)))-uniform. 

But we have no such uniformity guarantee for f^,. This would not pose an obstacle if J-- 
freeness were a monotone property (i.e., if each a 1 equalled l ki ). If that were the case, we could 
simply make / zero on all elements of H. Since H is still only a small fraction of the domain, 
the modified function would still be far from .F-free, and we would be guaranteed that remaining 
solutions to equations of J- induced by / would only use elements from cosets of H for which we 
have a guarantee about the corresponding coset of H' . But if J-"-freeness is not monotone, such a 
scheme would not work, since it's not clear at all how to change the value of / on H so that any 
solution to an equation from T would only involve elements from nonzero shifts of H. 

To resolve this issue, we further partition H' to find affine subspaces within H' on which we 
can guarantee that the restriction of / is uniform. The idea is that once we know that there is a 
solution involving H, we are going to look not at H' itself but at the smaller affine subspace within 
H' on which / is known to be uniform. Specifically, apply Lemma 17 to f^? with input parameters 
^f(£) and 7 23 (e/8, ^ f{£)). This yields subspaces H" and W, both of which contained in H' , such 
that \H"\ > <5 17 (^(^),72 3 (e/8,^(^)))|^ / | and dim(W/H") = ^ T {£). We further know that for 
every nonzero v G W/H", the function f^, is 7 23 ( e /8; ^^(-^-uniform. 
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Now, let's "copy" W on cosets I(u) + H' for every u G ¥V;/H. We do this by specifying 10 another 
linear map J : F%/H -+ F£ so that for any it G F%/H, the coset 11 J(u) + TV lies inside J(u) + H 
(which itself lies inside u + H). Each coset J(u) + W also has an H"-based partition of order 1 J/jr(£), 
just as W itself does. Consider v G W^/H" such that u + i?" lies inside J(u) + VF for some nonzero 
u G W2/H. Then, because we know the uniformity of f^f and we have a lower bound on the 
size of H", it follows from Lemma 11 that f^, is 723(^/8, ^ r ^(^))-uniform. Thus, for any nonzero 
v G ¥'2/ H" such that v + H" lies inside J(it) + W for some u G W%/H, it is the case that /™ is 
723(e/8, ^ r jr(^))-uniform. 

In the following, we will show how to apply Lemma 23 on some of these cosets ftf, . We have 
already argued their uniformity above. We now need to make sure that the pattern of their densities 
allow Lemma 23 to infer many induced copies of some equation in T. To this end, we modify / to 
construct a new function F : FJ? — > {0, 1}. F is initially identical to / on the entire domain, but is 
then modified in the following order: 

1. For every nonzero u G F%/H such that \p(F£ u ) - p(F^! (u) )\ > e/8, do the following. If 

p(F^f^) > |, then make F(x) = 1 on all x G u + H. Otherwise, make F{x) = on all 
x £ u + H. 

2. For every nonzero u € F^/i? such that p(F^f^) > 1 — e/4, make F(x) = 1 for all x G u + H. 
On the other hand, if u G W^/H is nonzero and p{F+l [u) ) < e/4, make = for all 
x G ii + -ff . 

3. If for all nonzero v G W/H", p^F^,) > ^, then make ^(x) = 1 for all x G H. On the other 
hand, if for all nonzero v G W/H", p(F^?,) < ^, them make F(x) = for all x G ff. (One of 
these two conditions is true by construction.) 

The following observation shows that F also must induce solutions to some equation from J-, 
since F is e-far from being .F-free. 

Claim 26 F is e-close to f. 

Proof: We count the number of elements added or removed at each step of the modification. 
For the first step, Corollary 16 guarantees that at most 5(0) < e/8 fraction of cosets u + H have 
\p(Fff U ) — p(Fflf^ w ')\ > e/8. So, F is modified in at most |2 n locations in the first step. In the 
second step, if 1 > p{F^t ) > 1 — e/4, then p(F^ u ) > 1 — 3e/8 because the first step has been 

completed. Similarly, if < p(F^j^) < e/4, then p(F^ u ) < 3e/8. So, F is modified in at most 
^2™ locations in the second step. As for the third step, H contains at most 2 n_£ < 2 n ~ 8 / e < 1 2 n 
elements for e G (0, 1). So, in all, F is e-close to /. ■ 

Now, we define a function p, : F| — >■ {0, 1, *} based on F and argue that it must partially induce 
solutions to some equation in T . Since H is of codimension £, Fr, /H = F| and we identify the two 
spaces. For u G W^/H, if F(x) = 1 on the entire coset u + H, let p(u) = 1. On the other hand, if 
F(x) = on the entire coset u + H, then let p{u) = 0. In any other case, let p{u) = *. 

10 One way to accomplish this is to define J appropriately for i linearly independent elements of F^/H and then 
use linearity to define it on all olV%/H. 

n Note that the image of J is to elements of F£ and not FJ/W, even though we think of the output as denoting a 
coset of W . The reason is that we will find it convenient to fix the shift and not make it modulo W . 
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Claim 27 There exists (E 1 ,^) G T such that {E\a l ) p,. 



Proof: As already observed, F is not F-free, and let (E l ,a l ) G T be some equation whose 
solution is induced by F at (x±, . . . , x^) G (F^)*'. Now let y = (yi, . . . , y^) G (I2) • where for 
each j G [ki], yj = Xj (mod H). It's clear that F/y = 0. To argue that F partially induces p at y, 
suppose for contradiction that for some j G [ki], p{yj) = but <t* = 1. But if p(yj) = 0, then F is 
the constant function on all of yj + H, contradicting the existence of Xj G yj + H with F(x) = 1. 
We get a similar contradiction if //(j/j) = 1 but a* = 0. ■ 

Using Definition 25, we immediately get that there is some (E l ,o~ l ) G T of size at most Vl/j^) 
such that (E l ,a l ) /i. Fix xi,...,^ G FJ? where F induces (E l ,a l ), and as in the above 
proof, let y±,...,yk. G W^/H where each y~ = Xj (mod H). Also, pick ki — 1 linearly indepen- 
dent elements . . . , v^-i from W/H", which is possible since dim(W/H") = j^(£) > ki — 1, 
and choose t> 1 G v\ + F 7 ', . . . , Ufcj-i G u^-i + F 7 ' such that t>i, . . . , are linearly independent. 
Additionally set = XljLi 1 ^j- Notice that none of v±,... , Vj^ are in H" . Now, consider the sets 

f^jf,^ Vl ^ +Vl , ffff^ V2 ^ +V2 , ■ ■ ■ , f^f / ( yk ^ +Vk * , (Notice these are restrictions of /, not F!) We will show 
that these sets respect the density and uniformity conditions for Lemma 23 to apply. 

As for uniformity, we have already argued that each of these sets is 723^/8, ^j-(£))-uniform, 
since J(yj) + Vj is not in H" for every j G [ki]. For density, we argue as follows. For every j G [ki], 
there are three cases: p(yj) = 1, p(yj) = 0, and p(yj) = *. Consider the first case. If yj + H was 
affected by the first modification from / to F, then, p{ffj!^ Vj ^) > \, and using the £(£)-uniformity of 
f+l (yj) along with Lemma 11, we get that p(/^ fe)+t,J ) > \ -8 {£) • (^(^(r), 7 23 (e/8, *jr(r))) > 
2 — I > § . If yj + H was affected by the second modification, then, by the same argument, we 
get that p(fg?i +v ') > 1 — I — § > §. Else, if yj + F was affected by the third modification 
from 5 to S', we are automatically guaranteed that pif^f, +Vj ) > \ since J"(j/j) + Vj G" F 7 '. The 
case p(yj) = is similar, and the analysis shows that p{f^!^ yj ^ +Vj ) > 1 — f. Finally, consider the 
"wildcard" case, p(yj) = *. This case arises only if yj 7^ and e/4 < p(fjji ) < 1 — e/4. Again 

using £(£)-uniformity of f^!^^ along with Lemma 11, we get that e/8 < p(ffff/ ' ) < 1 — e/8. 

Thus, we can apply Lemma 23 with e/8 and ^t(£) as the parameters to get that there are at 
least S 2 s(e/8,^^(£))\H"\ ki ~ 1 tupl es z = (z±, . . . , Zfy) with each Zj G J{yj)+Vj+H" at which {E l ,<j l ) 
is induced . Finally, each such Z\,... ,z^ leads to a distinct z' = (z[, . . . ,z' k .) G (FJ;)* 1 * at which 
(E 1 ,a l ) is induced by /, by setting each z'- to J(yj) + Vj + Zj and observing that Y^jLi J(lJj) + v j = 
J [Ylj=i Uj^j + Sj=i v j = 0- This completes the proof of Theorem 22. ■ 



3.2 Extending to Systems of Equations of Complexity 1 

As mentioned in the introduction, the result we actually prove is stronger than Theorem 3. To 
describe the full set of properties for which we can show testability, we first need to make the 
following definition. 

Definition 28 (Complexity of linear system [GT08]) An mxk matrix M over ¥2 is said to 
be of (Cauchy-Schwarz) complexity c, if c is the smallest positive integer for which the following is 
true. For every i G [k], there exists a partition of [k]\{i} into c + 1 subsets S±, ■ ■ ■ , S c +i such that 
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for every j € [c + 1], (ej + X^'es^- e^/J g" rowspace(M), where rowspace(M) is £/ie linear subspace 
of¥ 2 spanned by the rows of M. 

In other words, if we view the rowspace of the matrix M as specifying a collection of linear depen- 
dencies on k variables x±, . . . , x^, then M has complexity c if for every variable Xi, the rest of the 
variables x±,..., ■ ■ ■ ,Xk can be partitioned into c+ 1 sets Si,... , S c+ i such that Xj is not 

linearly dependent on the variables of any single Sj. Let us make a few remarks to illustrate the 
definition. Green and Tao show (Lemma 1.6 in [GT08]) that if each of these linear dependencies 
involves more than two variables, then the complexity of M is at most rank(Af) = m. In particular 
then, if M has one row and is nonzero on more than two coordinates, M has complexity 1. This 
is the setting we discussed in the introduction. We slightly extend this observation in the claim 
below. Before we state it, we observe that in the context of property testing, it is only natural to 
exclude matrices which yield linear dependencies involving less than three variables. If the rowspace 
of the matrix M contains a vector which is nonzero at only one coordinate i, then for any string a 
of length k, the property of (M, cr)-freeness must contain all functions / such that /(0) = 1 — Cj, 
and so every function is exponentially close to such a property. Similarly, if rowspace(M) contains 
a vector nonzero only at two coordinates i and j, then for any a € {0, l} fc , either (M, <j)-freeness 
is trivial (if <7j 7^ <7j) or it is equivalent to (M' , o"')-freeness where a' is the string obtained by 
removing coordinate j and M' is the matrix obtained by removing column j, adding 1 (mod 2) to 
every element in column i and row-reducing the resulting matrix. 

Claim 29 // M £ ~§? 2 nxk i s a matrix with two rows such that every vector in its rowspace has at 
least three nonzero coordinates, then M has complexity 1. 

Proof: Let Ri C [k] be the set of coordinates for which the first row is nonzero, and R 2 C [k] 
those for which the second row is nonzero. We can assume that Ri $Z R2 and R2 2 R\, because 
if, say, Ri C R 2 , we could replace the second row by the sum of the first and second, making Ri 
and i?2 disjoint but preserving the rowspace of the matrix. Also, we we can assume w.l.o.g. that 
RiUR 2 = [k]. 

Fix is [k]. We want to show a partition of into sets Si, S 2 such that + Yli'eS! e *' ^ 

rowspace(M) and similarly for S 2 - If i S Ri\R 2 , let Si consist of two elements, one from R 2 \Ri 
and one from i?i\{i}, and let be the rest. If i 6 R 2 \Ri, let Si consist of one element from 
Ri\R 2 and one from R 2 \{i}, and let S2 be the rest. And finally, if i 6 Ri n R 2 , let Si consist of 
one element from Ri\R 2 and one from R 2 \Ri, and let S 2 be the rest. It is straightforward to check 
that the definition of complexity 1 is satisfied by these choices. ■ 

More generally, an infinitely large class of complexity 1 linear systems is generated by graphic 
matroids. We refer the reader to [BCSX09] for definition and details. That this class contains the 
class of matrices proved to be of complexity 1 in Claim 29 is easy to show. We proved the claim 
separately above only to be self-contained without introducing matroid notation. One final remark 
is that if M is the matrix in the characterization of Reed-Muller codes of order d from Appendix 
A, then M has complexity exactly d; see Example 3 of [GT08]. 

Our main result in this section is the extension of Theorem 3 to complexity 1 systems of 
equations. 
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Theorem 30 Let F = {(M 1 , c 1 ), (M 2 , a 2 ), . . . } be a possibly infinite set of induced systems of 
equations, with each M l of complexity 1. Then, the property of being T-free is testable with one- 
sided error. 

We next describe how to modify the previous proof to the new setting. The following analog to 
Theorem 22 is the core of the proof of Theorem 30. 

Theorem 31 For every infinite family F = {(M 1 , a 1 ), (A/ 2 , a 2 ), . . . , (M l , a 1 ), . . .}, where each M % 
is aniiX k^ matrix over ¥2 of complexity 1, there are functions iVj-(-), kjr{-) and 5j^(-) such that the 
following is true for any e € (0, 1). If a function f : Fg — > {0, 1} with n > Njr(e) is e- far from being 
J- -free, then f induces S-2 n ( ki ~ mi ' many copies of some (M l ,a l ), where < kjr(e) and 5 > Sjr(e). 

We show how to deduce Theorem 30 from Theorem 31 next. Note that, as promised earlier, 
this is also a proof of Theorem 3 assuming Theorem 22. 

Proof of Theorem 30: Theorem 31 allows us to devise the following tester T for F-freeness. 
T, given input / : F?j — > {0, 1}, first checks if n < iVjr(e), and in this case, it queries / on the entire 
domain and decides accordingly. Otherwise, T selects independently and uniformly at random a 
set D of d elements from FJ>, where we will specify d at the end of the argument. It then queries 
all points in the linear subspace spanned by the elements of D and then accepts or rejects based 
on whether / restricted to this subspace is J 7 - free or not. 

Clearly, if / is F-free, then the tester always accepts because the property is subspace-hereditary. 
Also, if n < Njr(e), then the correctness of the algorithm is trivial. So, suppose / is e-far from 
F-free and n > Njr(e). For the M l guaranteed to exist from Theorem 31, let K be a ki x c matrix 
over F2, where c = ki — m; < fcj-(e), such that the columns of K form a basis for the kernel 
of M\ Then, every y = . . . ,y c ) € (F^ yields a distinct vector x = (x\, . . . ,Xk) € (FJf) 
formed by letting x = Ky that satisfies M l x = M l Ky = 0. Therefore, because of Theorem 31, 
the probability that uniformly chosen yi,--- ,y c € FJJ yield x = [x\, . . . ,Xk) such that / induces 
(M l ,a l ) at x is at least 5j?(e). The probability that D does not contain such yi,...,y c is at most 
(1 - 5) d / c < e 5 ^ d l c < 1/3 if we choose d = 0{c/5 F {e)) = 0{k T {e)/5 F {e)). Thus with probability 
at least 2/3, span(D) contains x±, . . . , x^ such that / induces (M l , a 1 ) at x = (xi, . . . , Xk), making 
the tester reject. ■ 

To prove Theorem 31, the main ingredient that changes is the counting lemma. 

Lemma 32 (Counting Lemma for Complexity 1) For every r/ € (0, 1) and integer k > 2, 
there exist 7 = 7jw(f7, k) and 5 = S vsijli k) such that the following is true. Suppose M is an 
m x k matrix of complexity 1 and rank m < k, a G {0, l} k is a tuple, H is a subspace o/F^, and 
f : F2 — > {0, 1} is a function. Furthermore, suppose there are k not necessarily distinct elements 
ui, . . . , life S Frj/iT such that Mu = where u = (u\, . . . , uj.), f^ Ui '■ H — > {0, 1} is ^-uniform for 
all i E [k], and p(f^ Ul ) is at least r/ if o~(i) = 1 and at most 1 — r/ if a(i) = for all i G [k]. Then, 
there are at least 5\H\ k ~ m many k-tuples x = (x\,X2, ■ ■ ■ ,Xf;), with each xi £ ui + H, such that f 
induces (M, a) at x. 

Lemma 32 is a special case of the Generalized von Neumann Theorem (Proposition 7.1 in 
[GT08]). The rest of the proof is a straightforward modification of Section 3.1. Namely, whenever 
the old proof requires k elements or k cosets x\, . . . ,Xk to satisfy the equation x\ + • • • + x^ = 0, 
the new proof would require that they satisfy the equation Mx = where x = (xi, . . . , x^). 
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4 Characterization of natural one-sided testable properties 



We now turn to showing Theorem 10 which states that for linear-invariant properties, testability 
with a one-sided error oblivious tester is equivalent to the property being semi subspace-hereditary 
(recall here Definition 9). 

First we formalize the discussion from the introduction regarding the fact that it is always 
possible to assume that the testing algorithm for a one-sided testable linear-invariant property 
makes its decision only by querying the input function on a random linear subspace of constant 
dimension. 

Proposition 33 Let V be a linear invariant property, and let T be an arbitrary one-sided tester 
for V with query complexity d(e,n). Then, there exists a one-sided tester T' for V that selects a 
random subspace H of dimension d(e,n), queries the input on all points of H , and decides based on 
the oracle answers, the value of e and n, and internal randomness 12 . Note that T' is non-adaptive 
and has query complexity 2 d ( e ' n ^ . 

Proof: Consider a tester T2 that acts as follows. If the tester T on the input makes queries 
xi, . . . , Xd, then T' queries all points in span(xi, . . . , Xd) but makes its decision based on x±, . . . , Xd 
just as T does. Clearly, T2 is also a one-sided tester for V and with query complexity at most 2 d ^ . 

Now, define a tester T' as follows. Given oracle access to a function / : F2 — > {0, 1}, T 1 first 
selects uniformly at random a non-singular linear transformation L : F2 — > F2, and then invokes 
T2 providing it with oracle access to the function / o L. That is, when T2 makes query x, then 
algorithm T' makes query L(x). We argue that the sequence of queries made by T' are the elements 
of a uniformly chosen random subspace of dimension at most d(e). To see this, fix the input / and 
the randomness of Ti- Then, for each i € [2 rf ( e )] for which the z'th query, Xj, made by T2 is linearly 
independent of the previous i — \ queries, x%, . . . , Xi-i, it's the case that L(xj) is a uniformly chosen 
random element from outside span(L(xi), . . . , L{xi-\j). So, for every fixing of the random coins of 
T2, the queries made by T' span a uniformly chosen subspace of dimension at most d(e), and hence, 
this is also the case when the coins are not fixed. T" is a one-sided tester for V because if / G V, 
then / o L € V by linear invariance, and if / is e-far from J 7 , then / o L is also e-far from V because 
L is a permutation on Fj. ■ 

An oblivious tester, as defined in Definition 8, differs from the tester T' of the above proposition 
in that the dimension of the selected subspace and the decision made by the tester are not allowed 
to depend on n. As argued there, it is very reasonable to expect natural linear-invariant properties 
to have such testers, and indeed, prior works have already implicitly restricted themselves in this 
way. 

We can now proceed with the proof of Theorem 10. 

Proof of Theorem 10: Let us first prove the forward direction of the theorem. Note that for this 
direction, we do not need to assume the truth of Conjecture 4. Given a linear-invariant property V 
that can be tested with one-sided error by an oblivious tester, we will build a subspace-hereditary 
property "H containing V, by identifying a (possibly infinite) collection of matrices M % and binary 
strings & l such that H is equivalent to the property of being {(M*,cr*)}j- free. 

12 Note here, we leave open the possibility that the decision of the tester may not be based only on properties of 
the selected subspace. This gap can be resolved using the same techniques as used by [GT03] for the graph case, but 
this point is not relevant for our purposes and so we do not elaborate more here. 
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Let S consist of the pairs (H,S), where H is a subspace of F2 and S C H is a subset, that 
satisfy the following two properties: (1) dim(H) = d(e) for some e, and (2) if for this e, the tester 
rejects its input with some positive probability when the evaluation of its input on the sampled 
subspace is I5. For (H,S) € S let d = &\m(H). Consider the matrix Ah over F2 with each row 
representing an element of H in some fixed basis. Notice that An is a (2^ x £)-sized matrix. Define 
Mh, a matrix over F2 of size (2 £ - 1) x 2 £ , such that M H A H = 0. Finally, for each i € [2 f ] define 
°"s(*) = Is^i)) where Xj is the element represented in the i'th row of Ah- Let .M be the set of 
pairs (Mff,crs) obtained in this way from every (H, S) € S. 

We now proceed to verify that T~L satisfies the conditions of Definition 9. To show that V is 
■M-free, let / £ V n , and suppose that there exists (Mjj,crs) € .M such that (Mh,cts) •->■ /, for 
some e, and for some -ff with dim(i^) = d(e) and S C. H. We show that / is rejected with some 
positive probability, a contradiction to the fact that the test is one-sided. If (Mh,o~s) is induced 
by / at (xi, . . . , x 2 d{e))i then these elements necessarily span a (f(e)-dimensional subspace so that 
the function restricted to that subspace is I5 o L for some linear transformation L : FJ? — >• F^ 1 ^ 
(determined by the choice of basis that was used to represent H). Thus, this immediately implies 
by the definition of (Mh,(Js) that the tester rejects / with positive probability. 

To verify the second part of the Definition 9, let M(e) = d(e). Suppose / : F?? — > {0, 1}, with 
n > M(e) is e-far from satisfying V. In this case, in order for the tester to reject / with positive 
probability, it must select a (i(e)-dimensional subspace H so that the restriction to H equals the 
indicator function on S (upto a linear transformation), for some (H,S) £ S. Therefore T is not 
M-free, and thus T g H. 

It remains to show the opposite direction of Theorem 10. We here assume Conjecture 4 that 
every subspace-hereditary property V is testable by a one-sided tester. Our first observation that, 
in this case, it is actually testable by an oblivious one-sided tester. Namely, we show that the clearly 
oblivious tester, which checks whether the input function restricted to a random linear subspace 
satisfies V or not, is a valid tester. We need to argue that if a non-oblivious tester rejects input 
/ that is e-far from V by querying its values on a random d(e)-dimensional subspace (we already 
know the tester is of this type from Proposition 33), then with high probability, the input function 
restricted to a random 3d(e)-dimensional subspace does not satisfy the property V . Suppose it did. 
But then, if the original tester first uniformly selected a 3d(e)-dimensional subspace H and then 
uniformly selected a d(e)-dimension subspace H' inside it, and ran its decision based on f\jj', it will 
accept the input with large probability, which is a contradiction to the soundness of the tester since 
H" is a uniformly distributed d(e)-dimensional subspace. Thus, for a testable subspace-hereditary 
property, we can assume that the tester simply checks for V on the sampled subspace, and is hence, 
oblivious to the value of n. This argument is analogous to one of Alon for graph properties, reported 
in [GT03]. 

Now, assuming that every subspace-hereditary property is testable by an oblivious one-sided 
tester (Conjecture 4), we wish to show that every semi subspace-hereditary property is testable by 
an oblivious one-sided tester. Let V be a a semi subspace-hereditary property and let T~i be the 
subspace-hereditary property associated to V in Definition 9. By our assumption, % has a one-sided 
tester T', which on input e makes Q'(e) queries and rejects inputs e-far from % with probability 
2/3. The tester T for V makes Q(e) = max(Q'(e/2), 2 M ^^ 2 ^) queries (where M(-) comes from 
Definition 9) and proceeds as follows. If the size of the input is at most Q(e), then by definition, T 
receives the evaluation of the function all of the input and in this case, it simply checks if the input 
belongs to V. Otherwise T emulates T' with distance parameter e/2 and accepts if and only if T' 
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accepts. 

Notice that T is one-sided. Indeed, if the input / satisfies V then / E H and thus T' always 
accepts, causing T to always accept. To prove soundness, we first argue that if / is e-far from V 
then it is e/2-far from H. Suppose otherwise, and modify / in at most an e/2 fraction of the domain 
in order to obtain a function g £ %. Thus g is still e/2-far from V, and by Definition 9 g g" 7i, a 
contradiction. Finally, since / is e/2-far from % and since T' mistakenly accepts such inputs with 
probability at most 1/3 so does T' . ■ 

5 Concluding Remarks and Open Problems 

Obviously, the main open problem we would like to see resolved is Conjecture 4. One appealing 
way to prove the conjecture would be to proceed as we have but to obtain a stronger notion of 
pseudorandomness in the regularity lemma. The notion of e-uniformity obtained from Green's reg- 
ularity lemma corresponds to the Gowers U 2 norm, whereas in order to be able to prove Conjecture 
4 in its full generality, we would presumably need a similar regularity lemma with respect to the 
Gowers U k norm [GowOl] for any fixed k. Such a higher order regularity lemma has been very 
recently obtained by Green and Tao [GT10] over the integers and over fields of large characteristic. 
However, it is not yet available over F2, as the inverse conjectures for the Gowers norms over F2 
have not yet been completely clarified [GrelO]. 

Let us mention some other observations and open problems related to this work. 

• As we have mentioned in Subsection 1.4, it is not too hard to construct linear-invariant 
properties which are not testable. Actually, there are properties of this type that cannot be 
tested with o(2 n ) queries. One example can be obtained from a variant of an argument used 
in [GGR98] as follows; it is shown in [GGR98] (see Proposition 4.1) that for every n there 
exists a property of Boolean functions that contains 2 To 2 ™ of the Boolean functions over FJ? 
and cannot be tested with less than <^>2™ queries. This family of functions is not necessarily 
linear invariant, so we just "close" it under linear transformation, by adding to the property 
all the linear-transformed such functions. Since the number of these linear transformation is 
bounded by 2 n (corresponding to all possible n x n matrices over F2) we get that the new 
property contains at most 2 n2 2To 2 < 2s 2 Boolean functions. One can verify that since this 
new family contains a small fraction of all possible functions the argument of [GGR98] caries 
over, and the new property cannot be tested with o(2 n ) queries. 

• The upper bound one obtains from the general result given in Theorem 3 is terrible in terms 
of its dependence on 1/e. A natural open problem would be to find a characterization of these 
properties that can be tested with a number of queries that depends polynomially on e. This, 
however, seems to be a very hard problem. Even if the only forbidden equation is x + y = z 
it is not known if such an efficient test exists. This question was raised by Green [Gre05]; see 
[BX10] for current best bounds. 

• Our result here gives a (conjectured) characterization of the linear-invariant properties of 
Boolean functions that can be tested with one-sided error. It is of course natural to try to 
extend our framework to other families of properties, characterized by other or more general 
invariances. For instance, can we carry out a full characterization for testable affine invariant 
properties of Boolean functions on the hypercube? 
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• It would be valuable to understand formally why the technology developed for handling graph 
properties can be extended so naturally to linear-invariant properties. This "coincidence" 
seems part of a larger trend in mathematics where claims about subsets find analogs in 
claims about vector subspaces. See [Coh04] for an interesting attempt to shed light on this 
puzzle. 
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A Proofs omitted from Section 1 

Characterization of Reed Muller codes by forbidding systems of induced equations 

First recall that Reed Muller codes of order d are defined as 

TZM(d) = | / : F^ F 2 : /(*) = £ J] Xi . 

[ Sc[n],\S\<d ieS 

The most common characterization of HA4(d) (see for example [AKK + 05]) is that / G lZM{d) 
if and only if / satisfies 

£ /(« + J>i) =0, forall (a,a u . . . ,a d+1 ) G (F£) d+2 . 

Sc[n],\S\<d+l V ieS / 

We use this description to obtain a matrix M G F^ 2 ^ 2)x(2 ) a collection of a 1 G 
{0, l} 2d+1 such that KM(d) is {(M, o- l )}i- free. Intuitively, we want M to encode all the linear 
relations between the elements of the set A = {a + J2ies a «}o<|S|<d+i; an d we want to use the a l, s 
to enforce the fact / should evaluate to 1 on an even number of elements of A. 

More exactly, assume that B = {a, a + a±, . . . , a + ctd+i} are linearly independent. For every 
(3 G A — B, add to M the row which is the vector representing /3 in the basis B. Further, consider 
all the a 1 G {0, l} 2d+1 such that \{j : cr*- = 1}| is odd. Clearly the number of such <t*'s is finite, and 
the patterns allowed by forbidding all (M, a 1 ) are only those that satisfy the above characterization. 
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Finally, notice that setting d = 1 the resulting matrix M contains only one row, and thus 
Theorem 3 applies to testing linearity. 

We conclude with the proof of Proposition 6 which was also omitted from the Introduction. 

Proof of Proposition 6: In one direction, it is easy to check that J-"-freeness is a subspace- 
hereditary linear-invariant property, for any fixed family T . 

Now, we show the other direction. For a subspace-hereditary linear-invariant property V, let 
Obs denote the collection of pairs (d,S), where d > 1 is an integer and S C ¥ d is a subset, such 
that Is does not have property V and is minimal with respect to restriction to subspaces. In other 
words, (d, S) is contained in Obs iff Is G" Vd but for any vector subspace U C ¥ d of dimension 
d 1 < d, ls\ u G Vd> where S\jj C U is the restriction of S to U. 

For every (d, S) G Obs, we construct a matrix Md and a tuple as such that any / with property V 
is {Md, erg) -free. Define Ad to be the 2 rf -by-<i matrix over F2, where each of the 2 d rows corresponds 
to a distinct element of ¥ d represented using some choice of bases. Now, define Md to be a (q d — d)- 
by-q d matrix over F, such that MdAd = and rank(M^) = q d —d. Define as as (a(l), a(2), . . . , a(2 d )) 
where a(i) = ls(%i) with X{ being the element of ¥ d represented in the ith row of Ad- We observe 
now that any / : Fj — > {0, 1} having property V is (Md, as)-free. Suppose the opposite, so that 
there exists x = (x\, . . . ,x q d) G (F^)^ satisfying Mx = and /(xj) = a(i). Then, by definition of 
Md, the X\,... ,x 2 d are the elements of a <i-dimensional subspace V over F2, and by definition of 
C5i Sf\v = where 5/ is the support of /. Thus f\y £ V which is a contradiction to the fact that 
/ has property V because V is subspace-hereditary. 

Finally, define T-p = {(Md,as)}- We have just seen that any / having property V is T-p-free. 
On the other hand, suppose / does not have property V . Then, because of heredity, there must be 
a <i-dimensional subspace V such that the support of f\y is isomorphic to S for some (d, S) G Obs 
under linear transformations, which means by the same argument as above, that / will not be 
(M d ,a s )-free. ■ 
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